Neural Machine Translation with Synchronous Latent Phrase Structure
نویسندگان
چکیده
It has been reported that grammatical information is useful for machine translation (MT) tasks. However, the annotation of incurs significant human costs. Furthermore, it not trivial to adapt MT because usually employs tokenization standards might capture relation between two languages and consequently, subword such as byte-pair-encoding used alleviate out-of-vocabulary problems; however, this be compatible with those annotations. In work, we introduce methods incorporate without supervising explicitly: first, latent phrase structure induced in an unsupervised fashion from attention mechanism; second, structures encoder decoder are synchronized so they each other using constraints during training. We demonstrate our approach performs better tasks: word alignment, extra resources. found enhance precision alignments through synchronization constraint after exact alignment analysis.
منابع مشابه
Neural Machine Translation with External Phrase Memory
In this paper, we propose phraseNet, a neural machine translator with a phrase memory which stores phrase pairs in symbolic form, mined from corpus or specified by human experts. For any given source sentence, phraseNet scans the phrase memory to determine the candidate phrase pairs and integrates tagging information in the representation of source sentence accordingly. The decoder utilizes a m...
متن کاملNeural Phrase-based Machine Translation
In this paper, we present Neural Phrase-based Machine Translation (NPMT). Our method explicitly models the phrase structures in output sequences using SleepWAke Networks (SWAN), a recently proposed segmentation-based sequence modeling method. To mitigate the monotonic alignment requirement of SWAN, we introduce a new layer to perform (soft) local reordering of input sequences. Different from ex...
متن کاملTowards Neural Phrase-based Machine Translation
In this paper, we present Neural Phrase-based Machine Translation (NPMT). Our method explicitly models the phrase structures in output sequences using SleepWAke Networks (SWAN), a recently proposed segmentation-based sequence modeling method. To mitigate the monotonic alignment requirement of SWAN, we introduce a new layer to perform (soft) local reordering of input sequences. Different from ex...
متن کاملTowards Neural Phrase-based Machine Translation
In this paper, we present Neural Phrase-based Machine Translation (NPMT). Our method explicitly models the phrase structures in output sequences using SleepWAke Networks (SWAN), a recently proposed segmentation-based sequence modeling method. To mitigate the monotonic alignment requirement of SWAN, we introduce a new layer to perform (soft) local reordering of input sequences. Different from ex...
متن کاملTowards Neural Phrase-based Machine Translation
In this paper, we present Neural Phrase-based Machine Translation (NPMT).1 Our method explicitly models the phrase structures in output sequences using SleepWAke Networks (SWAN), a recently proposed segmentation-based sequence modeling method. To mitigate the monotonic alignment requirement of SWAN, we introduce a new layer to perform (soft) local reordering of input sequences. Different from e...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Shizen gengo shori
سال: 2022
ISSN: ['1340-7619', '2185-8314']
DOI: https://doi.org/10.5715/jnlp.29.587